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Abstract. Passive optical networks are increasingly used for access to the 
Internet and it is important to understand the performance of future long- 
reach, multi-channel variants. In this paper we discuss requirements on the 
dynamic bandwidth allocation (DBA) algorithm used to manage the upstream 
resource in a WDM EPON and propose a simple novel DBA algorithm that is 
considerably more efficient than classical approaches. We demonstrate that the 
algorithm emulates a multi-server polling system and derive capacity formulas 
that are valid for general traffic processes. We evaluate delay performance 
by simulation demonstrating the superiority of the proposed scheduler. The 
proposed scheduler offers considerable flexibility and is particularly efficient in 
long-reach access networks where propagation times are high. 



1. Introduction 

The development of dynamic optical switching is widely recognized as an essen- 
tial requirement to meet the anticipated growth in Internet traffic. Already, passive 
optical networks (PONs) are in use in many countries to provide high speed ac- 
cess and penetration is expected to increase rapidly in coming years. In this paper 
we consider the performance of the IEEE standardized EPON and its likely de- 
velopments where fiber capacity is multiplied by the use of wavelength division 
multiplexing (WDM) and reach is extended by the use of optical amplifiers. 




up to 100 km 



(long reach EPON) 



Figure 1. EPON components: ONUs, passive splitters and OLT 
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Figure [T] illustrates the main components of the EPON. Optical network units 
(ONUs) are situated in or close to user premises. An ONU might be dedicated to 
an individual domestic or business user or, when fiber is terminated at the curb 
or at the building, it could concentrate the traffic of several users. The ONUs of 
the EPON are controlled from an optical line termination (OLT) equipment that 
realizes the interface with the electronic packet switched network and manages 
downstream and upstream traffic. Passive splitters are used to broadcast down- 
stream optical signals to the ONUs and to merge upstream signals destined to the 
OLT. There may be more than one level of splitters, notably in proposed long-reach 
PONs where the distance between central office and user can attain f 00 km. The 
number of ONUs per EPON is currently limited to 64 for an EPON of upstream 
and downstream rate of 1 Gb/s. Future WDM EPONs will clearly have greater 
capacity. 

Each ONU receives all downstream traffic on the wavelengths to which it has 
access and extracts its own packets based on the destination address. Upstream 
traffic management is more complicated. To avoid collisions, the ONUs must co- 
ordinate their upstream transmissions to the OLT. In EPON this coordination is 
performed by means of a dynamic bandwidth allocation (DBA) algorithm. The 
EPON implements a protocol allowing the ONUs to communicate their current 
requirements for upstream transmission time and the OLT to assign the necessary 
slots. The DBA thus emulates a kind of polling system. There are, however, a 
number of particularities that prevent us from directly applying known results on 
the performance of polling systems. 

The physical separation of user queues (in the ONUs) and their controller (in the 
OLT) imposes a form of gated service with a variable delay between request and 
grant. This delay depends on the propagation time and is particularly significant 
for a long-reach EPON where the distance between OLT and ONU can attain 100 
km. This leads to one-way propagation times of up to 500 fxs which is significant 
compared to the transmission time of an Ethernet packet of only 12 /is at 1 Gb/s. 
It is necessary to compensate for the different propagation times of different ONUs. 
To ensure the polling cycle remains low even when some ONUs have high load, the 
DBA must impose an upper limit on the size of each grant thus realizing a kind of 
limited gated polling. 

DBA algorithms may be distinguished as offline or online depending on whether 
or not the OLT waits to receive all requests before deciding on allocations and 
dispatching the grants. Offline DBAs allow tighter control and can implement 
sophisticated bandwidth sharing policies. It can, however, lead to excessive delays 
when propagation times are large since the schedule can only be computed once in 
every round trip time for the most distant ONU. Online DBAs are more efficient 
since the grant can be computed and sent as soon as the request is received. Most 
online algorithms described in the literature generate a grant explicitly for every 
request received. This policy can also lead to wasted capacity when propagation 
times are long since the service cycle is still at least as long as the longest round 
trip time. An alternative, that we explore in this paper, is for the OLT to issue 
grants spontaneously based on its current knowledge of ONU requirements. This 
is acquired from requests sent with upstream data sent in response to preceding 
grants. Polling is thus driven by grants rather than requests. After the names of 
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the grant and request messages in EPON, we refer to these alternatives as GATE- 
driven and REPORT-driven scheduling, respectively. GATE-driven scheduling is 
simple to implement, compatible with standards and, to our knowledge, novel. 

In the paper we describe GATE-driven scheduling and derive the traffic capacity 
it delivers for a classical single wavelength EPON and for a WDM EPON where 
ONUs access all wavelengths with full availability. Traffic capacity is defined as 
the load beyond which the underlying polling system would be unstable. We can 
determine this analytically under general traffic assumptions by adapting known re- 
sults for multi-server polling systems. We evaluate delay performance by simulation 
and demonstrate that the GATE-driven algorithm clearly outperforms the classical 
REPORT-driven online scheduler. However, with the high transmission speed of 
optical fiber, delay performance is usually excellent until the network stability limit 
is attained. Traffic capacity is therefore the essential performance parameter for 
the EPON. Exact capacity results are more difficult to derive for WDM networks 
where ONUs have limited availability to the range of wavelengths. We propose and 
evaluate some approximations. 

Related work. There is an abundant literature on EPON and the numerous DBA 
algorithms that have been proposed. Two recent surveys, by McGarry el al [TB] and 
by Zheng and Mouftah [24 , present a comprehensive review. Skubic et al. compare 
DBAs used in EPON with alternatives proposed for GPON, the alternative ITU 
standard. A significant early development was the intcr-lcavcd polling algorithm 
IPACT proposed by Kramer et al. [11] . Most subsequently proposed DBAs seek to 
realize more complex sharing policies that, as discussed in the next section, we tend 
to consider less important than efficiently using the medium with low packet level 
latency. The algorithm we propose is closer to the original IPACT with limited 
gated service discipline. 

DBA for the WDM EPON, sometimes called dynamic wavelength and band- 
width allocation (DWBA), has received much less attention. An early proposal 
by Kwong et al. is a WDM extension of IPACT using REPORT-driven schedul- 
ing 1 12). McGarry and various co-authors have notably compared offline and online 
DWBA algorithms [T71 [TBI [H] showing the superiority of the latter in terms of 
both capacity and packet delay. A quite complex DWBA algorithm is proposed by 
Dhaini et al. for a hybrid TDM/WDM EPON [5]. This is further developed and 
applied by Meng et al. in the context of a proposed joint access and metropolitan 
optical network architecture called STARGATE [JJ5] Particularly relevant to the 
present work is a DBA proposed for a long reach GPON by Song et al. [21 . The 
authors recognize the resource waste caused by classical REPORT-driven schedul- 
ing and suggest the ONUs should initiate multiple interleaved cycles or threads. 
We believe our GATE-driven proposal is more efficient and offers greater flexibility. 

Most performance evaluations of DBA algorithms rely on simulation. A few 
authors have developed analytical models generally using techniques developed for 
polling systems. Park et al. derived closed form formulas for the mean packet delay 
in a symmetric PON with identical ONUs and negligible propagation times under 
the assumption of Poisson traffic and gated service [20]. The two-stage polling 
system identified in that paper has been further analyzed by van der Mei and 
Resing [531 H3] when ONUs have heterogeneous load. A paper by Aurzada et al. 
demonstrates that classical polling models do not apply when propagation times 
are significant and the EPON scheduler is REPORT-driven [1]. This is because 
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"switchover times" as the OLT successively sends grants to the ONUs are not then 
independent of the ONU queue sizes. This difficulty is removed with GATE-driven 
scheduling, as explained later. Moreover, since we focus on traffic capacity and 
not delay, we can make use of known results on the stability of multi-server polling 
systems that are applicable under quite general traffic assumptions. 

The rest of the paper is organized as follows. In the next section we highlight 
some essential features of the EPON architecture. We then proceed in Section [3] to 
describe the GATE-driven scheme and to prove its claimed performance properties 
in the case of the classical single wavelength EPON. Section [4] is devoted to the 
WDM EPON. We derive the traffic capacity when the ONUs can use all wave- 
lengths and discuss corresponding results when they have limited availability. The 
penultimate section is devoted to the presentation of simulation results where we 
numerically compare the delay performance of alternative DBA configurations. 

2. WDM EPON ARCHITECTURE 

We outline some basic architectural features deriving from technology and EPON 
standards before briefly discussing QoS. 

2.1. Technology. The potential for WDM PON technology to realize a cost ef- 
fective, high capacity access network is only beginning to be seriously explored. In 
particular, the issue of the number of wavelengths that can be economically main- 
tained is hardly discussed. Although dense WDM (DWDM) is capable of creating 
hundreds of channels on a single fibre, we follow most works in assuming PONs will 
exploit a relatively small number of wavelengths. 

An important issue is how the channels are shared. In several proposals, each 
ONU in a WDM PON is assigned a fixed pair of upstream and downstream wave- 
lengths With just one upstream wavelength per ONU the network is equiv- 
alent in terms of traffic capacity to a set of independent TDM PONs though some 
proposed architectures facilitate load balancing [8]. 

Maximum traffic capacity is obtained when all ONUs have full access to all 
wavelengths. However, this requires each ONU to be equipped with a costly array 
of fixed wavelength transceivers and network evolution to meet increasing demand 
is hard [T3]. Nevertheless, this architecture constitutes a useful benchmark for 
measuring the performance of more cost effective alternatives. 

Tunable transmitters exist that allow a source to switch wavelengths within a 
few nanoseconds. An ONU equipped with a tunable transmitter could thus access 
all available channels but use only one at any given time. A similar architecture 
results from the use of a reflective semi-conductor optical amplifier (RSOA) in the 
ONU [Hj . The OLT transmits downstream a carrier burst on a chosen wavelength 
that is modulated by the ONU before being reflected back upstream to the OLT. 
The ONU is then particularly simple and inexpensive. 

More generally, as advocated in [14 , to allow progressive buildout and maximize 
flexibility, future WDM PONs should support ONUs with varied technologies and 
capabilities. Some ONUs might have one or several fixed upstream wavelengths, 
others one or more tunable transmitters. 

2.2. MPCP. EPON upstream medium access control is based on the multi-point 
control protocol (MPCP) standardized by IEEE (802. 3ah). We consider the WDM 
extension to MPCP proposed by McGarry et al. [17] . 
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FIGURE 2. Format of REPORT and GATE messages with WDM 
extension from [T7] 

In addition to registration and configuration, the protocol fulfills the following 
three essential functions through the exchange of so-called GATE and REPORT 
packets (see Figure [2]): 

- the exchange of timing information allowing clock synchronization and pre- 
cise measurement of the propagation time for each ONU; 

- reporting to the OLT the current contents of ONU queues; 

- granting transmission opportunities to the ONUs based on these reports. 

The way REPORT and GATE messages are used to realize DBA is implementa- 
tion dependent. As discussed in the introduction, we distinguish offline and online 
scheduling and REPORT-driven and GATE-driven online scheduling. Our proposal 
is to implement GATE-driven online scheduling, as described in Section [3] below. 

2.3. QoS. An important function of DBA is traffic management and the fulfillment 
of service level agreements (SLAs) expressing QoS requirements. Experience shows 
that it is, to say the least, very difficult to efficiently satisfy a range of diverse 
SLAs in any multiservice network. On the other hand, the high capacity of optical 
access networks makes it rather easy to ensure excellent quality for all, except in 
situations of overload. In this context, we believe the following simple WDM PON 
traffic management strategy is adequate. 

The OLT ensures ONUs receive a "fair" bandwidth share, by regularly issu- 
ing GATE messages to each ONU. The issuing rate might vary from one ONU 
to another to perform inter-ONU differentiation if required. Each GATE grants 
upstream transmissions depending only on the ONU's own reports, implementing 
an upper limit that might be ONU dependent. The ONU is left to manage its 
own intra-ONU service differentiation, using the grants as appropriate to meet la- 
tency and throughput requirements of traffic classes, end users, application flows 
or particular services. The DBA ensures cycle times between successive grants is 
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Figure 3. GATE-driven schedule with 2 ONUs 

small enough to satisfy the most stringent latency requirements. We do not further 
consider intra-ONU scheduling in this paper. 

The GATE-driven DBA described next fulfills these objectives. We note, how- 
ever, that GATE-driven scheduling would also be advantageous for more sophisti- 
cated QoS architectures. 

3. GATE-DRIVEN SCHEDULING 

We introduce the notion of GATE-driven scheduling in the context of the classical 
EPON with one wavelength in each transmission direction shared by a group of N 
ONUs. 

3.1. Algorithm. GATE-driven scheduling decouples GATEs from REPORTS, al- 
lowing the OLT to poll ONUs according to a freely defined periodic sequence. Let 
5i denote the one way propagation delay between the OLT and ONU i. For the 
sake of simplicity, we generally assume the schedule for emitting GATEs is round 
robin. The n th GATE sent to ONU i has the nominal epoch gi(n) and instructs the 
ONU to send data for duration di(n) starting at time Si(n). REPORT messages 
are piggybacked at the end of a granted data transmission starting, therefore, at 
tme Si(n) +di(n) or, if di(n) = 0, sent as an isolated packet at time Si(n). Let ri(n) 
denote the arrival time of the REPORT at the OLT. Duration di(n) is determined 
from previously received REPORTS from ONU i. Each REPORT informs the OLT 
of data arrivals to the ONU queues since the time of the last REPORT while each 
GATE takes account of the latest information received (see Figure [3]) . 

We apply a form of limited gated polling. Let the residual amount of time 
necessary to transmit the currently known backlog of ONU i at time t be Ri(t). 
This is determined from previously received REPORTS and previously sent GATEs. 
The n th grant is upper bounded by d™ ax such that 

di(n) = min{R i (g i (n)),d? ax }. 

Note that, if d™ ax = oo, the ONU can send all traffic requested and we have a 
form of pure gated polling. In practice, GATE messages cannot always be sent 
at their designated time gi(n) since they share the downlink with data packets. 
Data are sent in encapsulated Ethernet frames. We assume the OLT schedules 
GATE messages with non-preemptive priority. The actual emission time can then 



UPSTREAM TRAFFIC CAPACITY OF A WDM EPON 



7 



differ from the desired epoch gi by up to the transmission time of a data packet. 
Assuming an MTU of 1500 bytes and a transmission rate of 1 Gb/s, the maximum 
delay is slightly more than 12 /lis. Denote this time by r. 

The time to send a REPORT message and the ensuing inter-packet guard time 
is Afl. The corresponding time for GATE messages is Aq. The guard time for 
upstream transmissions is around 1.5 /i yielding Ar « 2\is for a 1 Gb/s EPON. 
The downstream overhead Aq is smaller since all data is broadcast and guard times 
are not required. We later need the assumption A G < Ar. 

In the following, we suppose the ONU can always fully comply with the allocated 
grant di. In practice this may not be the case since Ethernet frames cannot be 
fragmented and the sum of the sizes of transmitted packets may need to be less 
than di. This will arise notably when dj = d" lax or when the ONU sends recently 
arrived, higher priority packets that were not accounted for in REPORT messages. 
Such under-filling leads to a loss of traffic capacity that might be as high as 15% 

ma- 



3.2. Efficiency. We seek to define the sequences g and s so that the upstream 
wavelength is always busy either transmitting data or REPORTS. Figure [3] illus- 
trates a fragment of such a schedule for an EPON with two ONUs. The following 
proposition provides a recursive schedule that realizes this objective. 

Proposition 1. The following recursions define a feasible schedule and ensure the 
upstream link is fully utilized: 

(1) 9i{n) = ffi-i(n) +di-i(n) + A R , 

(2) Si (n) = gi (n) + A G + A - Si. 

for 1 < i < N, n = 1, 2, . . . where Ao is an offset satisfying A<j > 2max{<5i} + r, 
5o(l) = do(l) — and, by convention, quantities cycle as go(n + 1) = gt<i{n). 

Proof. Feasibility requires 

(3) gi(n) + A G + t < Si(n) - Si, 

i.e., the GATE must arrive at the ONU before the scheduled start time accounting 
for maximal delay, and 

(4) gi(n) >ft_i(n)+A Gl 

i.e., the GATE for ONU i cannot be emitted before the end of the preceding GATE. 
Condition ^ follows from Q and A > 28 i + r while Q follows from Q, the 
assumption A^ > A G (see Sec. |3.1[ ) and the fact cf,(n) > 0. 
The uplink is always busy since 

(5) s i+ i(n) + 8 l+ i = Sj(n) + d l (n) + A R + Si, 

the arrival time at the OLT of the first bit of the n th transmission from ONU i + 1 
(l.h.s) occurs immediately after the guard time following the arrival of the last bit 
of the n th transmission from ONU i (r.h.s). This equation follows directly from |l]) 
and (2). □ 
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3.3. Traffic capacity. The EPON under GATE-driven scheduling behaves like a 
single server polling system with fixed switchover time A# with a non-standard 
service discipline. The grants do not relate to current queue occupancy but to 
the occupancy observed at some random time in the past as informed by the last 
REPORT to be received at the OLT. Figure [4] illustrates the interlacing of GATE 
epochs gi(n) with REPORT epochs ri(n') where, to simplify, we ignore durations 
Aq and A#. Note that the number of REPORTs arriving between gi(n — 1) and 
<7i(n) is random. It can take values between zero and an upper bound corresponding 
to the maximum cycle time ^(d™ a:E + Afl) divided by the minimum inter-REPORT 
gap A R . Known stability and cycle time results for periodic polling systems under 
a wide class of monotonic service disciplines carry over to this case [B] . 

Proposition 2. The EPON with GATE-driven scheduling is stable iff 



(6) 



t{ Pl /dr x }s < i, 



where pi is the ONU i load, p = pi and S = NAr is the sum of switchover 
times in one cycle. When the system is stable, the mean cycle time is 

S 



(7) 



C = 



1 



Proof. It is intuitively clear that the random offset between GATEs and corre- 
sponding REPORTs in no way changes the stability conditions of the underlying 
polling system. Condition ^ thus follows directly from the corresponding result 
for periodic polling systems proved in [6]. Alternatively the proposition may be 
seen as a simple corollary of Proposition [3] proved in Section [3] below. 
The mean cycle time can be written 



(8) 



C = ^E[di] + S. 



Now, di derives from a variable number of REPORTs received in previous cycles, 
as illustrated in Fig. [4j However, we do know there is one REPORT for every 
GATE and, when the system is stable, all reported data is eventually granted. 
We conclude that E[dj] must be equal to the expected amount announced in an 
arbitrary REPORT. Applying Little's law, this is just piC. Formula ([T]) follows on 
making the substitution in ([8]). □ 



The fact that the grant sizes are limited is crucial to allow to other ONUs to 
remain stable even if some ONUs are unstable because of their load. This is called 
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local stability in [6]. Let the ONUs be numbered such that p\ < ... < pn and 
define pj — Yj*i=i Pi as the overall load of ONUs 1, . . . ,j. Let Sj be the sum of 
the switchover time in one cycle and the deterministic maximum grants of ONUs 
j + 1, .. ., N during one cycle. The sequence (pj + PjSj/dj)j is increasing. Define 

k = max-jj, pj + < !}■ 

ONUs k + 1, . . . , N are then unstable while ONUs 1, . . . , k behave like a polling 
system with overall load p\ and switchover time during a cycle S^. 

Similarly, the formulas remain valid for non-homogeneous switchover times if S is 
interpreted as the sum of switchover times in one cycle. This would arise if an ONU 
reported independent queue sets that are granted separately in one GATE message 
with therefore zero switchover times (up to four separate grants are possible: see 
Fig. [2]). Finally, the periodic cycle does not need to be round robin. Some ONUs 
may be visited more frequently than others in each cycle, for example. We then 
need to calculate the total cycle switchover time 5* and to interpret d™ a:c as the 
overall maximum service time for ONU i in one cycle. 

3.4. Flexibility. As noted above, GATE-driven scheduling allows the definition 
of arbitrary service orders and can therefore offer differentiated service rates to 
different ONUs. This might be useful when ONUs have different traffic levels or 
their owners/users have more or less costly subscriptions. Of course, increasing the 
visit frequency only increases throughput in case of saturation but it does reduce 
latency. 

GATE-driven scheduling relies on repeatedly asking every ONU if it has packets 
to send, even if the ONU is persistently idle. Overhead might be reduced, thereby 
extending the capacity region, by reducing the polling frequency for ONUs that 
currently have no traffic. One possibility would be for the OLT to skip an ONU 
for x cycles if it persistently reports no traffic. The number x might grow (double, 
say) as the number of successively void reports increases. It would, of course, 
be necessary to maintain a minimal visit rate to avoid undue delay for an ONU 
that becomes active and to ensure latency remains acceptable for an active ONU 
transmitting at low rate. GATE-driven polling facilitates such flexible scheduling. 

4. Capacity of a WDM PON 

In this section we extend the GATE-driven performance results to a WDM EPON 
where ONUs have a full array of fixed wavelength transmitters. This solution is 
likely to be too expensive, however, and we therefore envisage an alternative where 
each ONU is equipped with a smaller number of fixed transmitters or a single 
tunable transmitter. The number of wavelengths is denoted by L. 

4.1. Full availability. Each ONU can transmit without restriction on any wave- 
length. We distinguish two scheduling policies where the OLT implements either 
a separate but identical service cycle for each wavelength or a single cycle for all 
wavelengths. We assume the cycle in both cases is round robin. 

4.1.1. One cycle per wavelength. The GATE epochs and transmission start times 
are determined for each wavelength using the recursions ([l} and pj). It follows 
from Proposition [T] that these schedules are feasible and efficient. Traffic capacity 
is given by the following proposition. 
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Proposition 3. The full availability WDM EPON with GATE- driven per-wavelength 
scheduling is stable iff 

(9) p + max( Pl /dT ax )S < L, 

i 

where S = NAr is the total switchover time per cycle. When the system is stable, 
the mean cycle time for each wavelength is 

(10) C = LS/(L- p). 

Proof. Let Wj(t) be the queue content of ONU i at time t (expressed as the time 
necessary to transmit this content at the wavelength rate) and define X(t) to be 
the process {Wi(t), . . . , Wjv(i), Y(t)} where vector Y(t) collects the supplementary 
variables necessary to make X(t) Markovian. Specifically, Y(t) has the following 
components 

- for each wavelength: the next ONU in the cycle to be sent a GATE message; 
the epoch, start time and grant of the last GATE message, 

- for each ONU: the time since the last arrival; the epochs and values of the 
last n reports where n = \J2 d" lax + NAr) /Ar] . 

The latter quantity is chosen to ensure the process records all REPORTS that may 
still be relevant to compute the next grant. 

The process X(t) being Markovian, the fluid limit approach is used to demon- 
strate the sufficiency of See [3], for example, and [7] in the case of multi- 
server polling systems. Fluid limits Wi(t) of the state of the queues at ONU i for 
1 < i < N is defined by 

W t (t) = lini j^Wi(\w\t), 

|tu|-K>o \W\ 

where w = (Wi(0)). Since only the components (W,(t)) of (X(t)) may can grow 
without bound, the Markov process will be shown to be ergodic if there exists some 
T > such that Wi(T) = holds for all i, independently of its initial value Wj(0) 
(see Theorem 4.16 in [3]). 

Let G\ (t) be the total amount of transmission time granted to ONU i by wave- 
length I before t with 

G l = ^2G\, G i = J2 G l imdG = Y, Gi = Ys Gl - 

i I i I 

Let V!(i) be the number of separate GATE messages sent to ONU i before t for 
wavelength I with corresponding sums V , Vi and V (the wavelength "visits" the 
ONUs). Since all wavelengths realize the same round robin schedule, for the fluid 
limit we have, for t > and 1 < I < L, 

(11) V\(t)=V\t)/N, 

i.e., every ONU is visited with the same relative frequency by every wavelength. 

Assume that at time t, some ONUs in the fluid system still have a backlog 
of work. Let J-(t) be the subset of ONUs that still have a fluid backlog at t, 
T(t) = {i:W l {t)>0}. 

Denoting time derivative with a dot, recalling that fluid limits are almost every- 
where differentiable, we have 



(12) 



W(t)=p-G(t), 
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where W(t) = J2i Wi(t). We now proceed to express G(t) in two ways. First, since 
the wavelengths are fully utilized, 

(13) t(t)=L-A R V(t), 

i.e., the useful transmission rate is the total capacity minus the switch overhead. 
Second, considering the rate at which data is transmitted, 

(14) 5<(t) = Pt , for i i T(t), 

(15) GM = dr*Vi(t), for i G Tit). 



Summing relations (14 1 and ( |15[ ), we have: 

(i6) = P - e E c^to/^ 

»&F(t) ieJF(t) 



and substituting in (13) and (12) yields 



Equating the r.h.s of (16) with that of (13), we have an equation for V(t). Solving 



(17) W(t) 



Now, due to condition Q, any term of the sum in the numerator of the r.h.s. 



of (17) is negative. This implies in particular that there exists some T such that 
W{T) — independently of the initial value of W(0). The Markov process (X(t)) 
is therefore ergodic. 

To prove the condition is necessary we proceed as in [7]. Note that when the 
system is stable, by symmetry, the mean time between two consecutive visits of any 
wavelength to a given ONU is the same. This is the expected cycle time, C. Let D\ 
be the expected overall transmission time granted to ONU i by wavelength / in a 
cycle that, for definiteness, starts and ends with the visit of wavelength 1 to ONU 
1. By symmetry, D\ is the same for each I, D\ — Di, say. Thus C — NAr + J2i Di- 
Equating the expected amount of work arriving in a cycle with the expected total 
grant, we have, 

LDi = Pi (s + E A J • 

Summing over i gives an equation for Di, eventually yielding Di = PiS/ (L — p). 
Since, for 1 < i < N, by construction Di < d™ ax we deduce the stated necessary 



condition for stability. Relation ( 10 ) is now straightforward to verify. □ 



4.1.2. One overall OLT schedule. Assume now that the OLT maintains a single 
round robin cycle and instructs the currently considered ONU, ONU i, say, to 
transmit on the next available wavelength taking account of grants issued in previ- 
ous GATE messages. Let A be that wavelength and designate the ONU in question 
by jx ■ With a slight abuse of notation let g\ , s\, d\ and fx denote the correspond- 
ing GATE epoch, start time, grant and finish time, respectively. The finish time 
includes the piggybacked REPORT message and guard time: fx = s x + dx + A R . 
The following proposition defines a feasible and efficient schedule. 



12 



N. ANTUNES, C. FRICKER, PH. ROBERT, AND J. ROBERTS 



Proposition 4. Let gi and Si be the GATE epoch and start time of ONU i, the 
next to be handled in the round robin cycle. The following relations define a feasible 
schedule and ensure all wavelengths are fully utilized: 

(18) g t = g x + d x + A R , 

(19) = 9l +A G + A -Si, 

where 

Aq = 2m&x{5i} + t. 



Proof. It is straightforward to verify that ( 18 ) and ( 19 1 ensure the feasibility con- 
dition 

(20) & + A G +r< s;-^, 

i.e., the GATE arrives at the ONU before the scheduled start time even accounting 
for the maximum possible downstream delay r. In particular, the GATE can always 
be sent before g t + t on the downstream wavelength used for ONU j\ . 

To show the wavelengths are fully utilized we need to prove Si + Si = f\, i.e., 
the transmission from ONU i arrives at the OLT immediately after the previous 



transmission. By (19) applied to ONU j\, we have s\ = g\ + Aq + Aq — Sj.. We 



also have fx = s\ + d\ + 8j x . Substituting for s\ and recognizing from ( 18 ) and 



( 19 1 that Si = g\ + dx + Ac + + Aq — Si, we readily verify that fx = st+Si. □ 



Proposition 5. The full availability WDM EPON with GATE-driven next avail- 
able wavelength scheduling is stable iff 

(21) p + m^( Pz /d™ ax )S < L, 

i 

where S = NAr is the total switchover time per cycle. When the system is stable, 
the mean cycle time is 

(22) C = S 2 /(L-p). 

We omit the proof that closely mirrors that of Proposition [31 The key is again 
that each ONU receives the same number of visits. Instead of we have Vi(t) — 
V(t) /N for 1 < i < N. 

4.1.3. Generalizations. As for single wavelength EPON it is possible to extend the 
above capacity results. Conditions for local stability when some ONUs are unstable 
can be inferred by appropriately redefining switchover times and residual overall 
load. Non-homogeneous switchover times can be taken into account. The service 
cycle need not be round robin and some ONUs can be visited more frequently 
than others. However, crucially, when the OLT operates a separate cycle for each 
wavelength, the cycles should have the same visit frequency for each ONU to enable 



a relation equivalent to (11). The cycle can adaptively omit inactive ONUs to 
economize switchover ties and to alleviate the processing load of the ONU. Note 
that this flexibility is facilitated by GATE-driven scheduling. 

4.2. Limited availability. While the ultimate aim is to evaluate the traffic ca- 
pacity of a hybrid EPON where ONUs have various transmission capabilities, we 
so far have variable partial results when wavelengths are not fully available. 
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4.2.1. Partial sharing. Consider first the case where each ONU is equipped with 
a number of fixed wavelength transmitters that may be less than L. To simplify 
we first ignore the grant limits e?™ aa: so that the underlying service discipline is 
gated polling. In this case the necessary and sufficient conditions for stability do 
not depend on the switchover times and are given by the following proposition. 

Proposition 6. When the service discipline is unlimited-gated, the WDM EPON 
with partial sharing is stable iff, for all subsets O of ONUs, we have 

(23) 5><|£(0)|, 

where C{0) is the set of wavelengths that can serve at least one member of O. 



Proof. Necessity of ( 23 ) is obvious since at least one ONU queue would otherwise 
grow unboundedly. To prove sufficiency, we apply the fluid limit approach using 
the notation of Proposition [3] Recall that IF(t) denotes the set of ONUs that still 
have a fluid backlog at t. Let F l (t) be the subset of T{t) that can use wavelength 
I. Considering that the system is work conserving, we necessarily have, 

(24) G{t) = 1, for I such that f 4>, 

where (j> denotes the empty set. Moreover, for / such that F l {t) ^ cj>, the wavelength 
is necessarily busy serving some ONU from J-(t). It follows that only wavelengths 
I such that J- l (t) = cj) can be serving ONUs not in F{t). We conclude, 

E o(t)= E ^(*)< = E 

where we make use of (fill). Thus 



(25) d(t)= e G 'w+ E G V),= E ft+iw))i- 

l, J 71 (t)=0 I,^' (*)#<£ »£F(t) 



By (121, 



(26) W(t)= E ft-MI- 

i£.F(t) 

and since J-{t) can be any subset of ONUs, we conclude W(t) is negative under 



conditions (23) and the system is therefore stable. □ 



To see the potential impact of limited grants we consider a limiting case where 
switchover times are negligible and the grant limit tends to zero. In this case, 
wavelengths are used in head of line processor sharing mode - each ONU with a 
non-empty queue gains a certain share of the wavelengths it can use. We show by 
a simple example that the natural stability conditions are then not sufficient. 
Example 1. 

Consider the case of two wavelengths l\ and I2 and two ONUs, such that ONU 1 
can transmit on l\ and I2 but ONU 2 can transmit only on I2. A natural condition 
for stability would be 

(27) p 2 < 1 and p x + p 2 < 2. 



We now proceed to show this condition is not sufficient. Assume condition (27) 



holds. If (27l holds, then either p x < 3/2 or p 2 < 1/2. If p x < 3/2, queue 1 empties 
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with probability 1. As above, skipping technical details, let (Wi(t),W2(t) be a 
fluid limit of ONU queue contents. If (W\(0), W 2 (0)) = (0,x) with x > 0, at the 
fluid scale W% is a birth and death process and the probability the queue of ONU 
lis empty is ir = 1 — 2p 1 /3. The fluid equation for (W 2 (t)) is therefore given by 

W 2 {t) = P2 ~ ^(1 - 7To) - 7TQ = ^/Ol +p 2 - 1- 



The necessary and sufficient condition for stability is thus (27) supplemented by 
Pi + 3p 2 < 3. 

This simple example shows that a loss of capacity occurs at queue 1: if Zi were 
used efficiently, l 2 would have only to cope with load max(0, p\ — 1) at queue 1, 
so that the stability condition would be max(0,pi — 1) + p 2 < 1 which is precisely 



condition (27). It may be verified that this would be the case if, instead of equally 
sharing l±, ONU 2 were given priority over ONU 1. This suggests there is scope 
for optimized scheduling but only when the grant size is not large enough that 
Proposition [6] is not a good approximation. 

4.2.2. Tunable transmitters. As with partial sharing, it appears difficult to derive 
analytical formulas for traffic capacity of a WDM EPON where ONUs arc equipped 
with tunable transmitters. The condition ^ is clearly necessary and would provide 
a reasonable capacity approximation when the probability an ONU can use multiple 
wavelengths in heavy traffic is small. Additional necessary conditions are obtained 
on considering each ONU in isolation. We clearly must have for 1 < i < N, 

(28) pi <U — 



X _1_ A ' 

where U is the number of tunable receivers in ONU i. 

To investigate whether these natural conditions might also be sufficient, we con- 
sider another toy example. We again neglect switchover times and suppose the 
maximum grant size becomes very small. 
Example 2. 

Consider the network with two wavelengths l\ and 1% and three ONUs. Each ONU 
can only use one wavelength at a time. The natural stability conditions now read 

(29) max(pi, p 2 , pa) < 1 and p x + p 2 + p3 < 2. 

The model is a multi-server multi-queue processor sharing system like that consid- 
ered in [4]. 



Assume pi > p 2 > P3 and (29) holds. This implies p 3 < 2/3 and the queue 
of ONU 3 empties with probability 1. At equilibrium, when the other ONUs are 
saturated, the probability that this queue is non-empty is given by 1 — ttq = Zp^/2. 
The queue of ONU 2 then receives service at a rate no less than 

2, 1 

-(1 -7T ) +7T = 1 - ~p 3 . 

Consequently, queue 2 empties with probability 1 under the condition p 2 < I — p^/2 



which also holds as a consequence of (29). We conclude that, if (29) holds, then 



the process {W 2 {t) ,Wait)) returns to state (0,0) with probability 1. Hence when 
ONU 1 is saturated, this process is ergodic. Let p be the probability that both 
queues are non-empty under the invariant distribution. The fluid limit equation for 
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(Wi(i)) is then 

(30) W^t) = 5 d = Pl -lp-(l-p) = pi -i+ l p . 

Theorem 3.1 of [1] states that this quantity is negative under the additional condi- 
tion p\ + P2 + P3 < 1- This result can be improved as follows. 

Assume p 2 + p3 < 1. By comparing the process (^(t), W^t)) with the content 
of a simple queue with input rate p 2 + p% and service rate 1, it is straightforward to 
show that Wzif) + Ws(t) is stochastically smaller than the workload of the queue 
and consequently that p is less than the probability this queue is non-empty, i.e., 



P < P2 + P3- In view of (30), 5 is negative if pi — 1 + (p2 + P3)/3 < 0. We deduce 



the following sufficient stability condition 

3^1 + P2 + P3 < 3, P2 + P3 < 1 

which is weaker than the condition p\ + p2 + P3 < 1 of 4J. It remains to establish 



if this condition is also necessary or, indeed, if necessary condition (29) is also 
sufficient. 

Establishing the traffic capacity of the WDM EPON with tunable transmitters 
thus appears as a challenging largely open problem. 



5. Performance 

We evaluate the performance of GATE-driven and REPORT-driven scheduling 
in a long reach EPON. We use the stability limits and the mean cycle time derived 
in Section 4 and 5 for GATED-driven scheduling and simulation to obtain the mean 
packet delay for GATE-driven and the performance of REPORT-driven scheduling. 

5.1. Simulation set up. We consider an EPON with a transmission bit rate of 
1 Gbps per wavelength in both upstream and downstream directions. The total 
number of active ONUs is N — 20 and these generate fixed size 1000 byte packets 
according to a Poisson process. Each ONU has infinite buffer capacity. The size 
of REPORT and GATE messages is 64 bytes and the time guard is 1.5 ps (i.e 
A G = A R m 2.12 ps). One-way propagation delays 8i, i = 1, . . . , N, are uniformly 
distributed between 10 and 500 ps corresponding to distances of between 2-100 km. 

5.2. GATE-driven vs REPORT-driven. We first present results of a classical 
TDM EPON with a single wavelength in each direction. 

Unlimited gated service. We compare GATE-driven and REPORT-driven schedul- 
ing without any limit on the grant size (d-imaxi is very large). Figure [5] shows 
the mean cycle time as a function of total load for symmetric traffic, i.e., all ONUs 
contribute equally to the total traffic load. As expected, both schedulers realize 
the full traffic capacity and the cycle time only becomes significant for very high 
loads (> .95). The cycle time of REPORT-driven scheduling is never smaller than 
lms, the longest round trip propagation tile of any ONU. This propagation time 
is experienced as a minimal delay for GATE-driven scheduling, as shown in Figure 
[6] This delay, excluding the transmission time, occurs even at low load due to the 
two stage reporting. Delays for REPORT-driven scheduling are 50% higher than 
for GATE-driven. 



16 



N. ANTUNES, C. FRICKER, PH. ROBERT, AND J. ROBERTS 




Figure 5. Mean cycle time with unlimited gated service (one wavelength). 
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Figure 6. Mean packet delay with unlimited gated service (one wavelength). 



Limited gated service. Figures [7] and [8] depict the mean packet delay of GATE- 
and REPORT-driven scheduling, respectively, for limited gated grant sizing with 
d™" 1 corresponding to 2, 4 and 6 Kbytes. The vertical lines in Figure [7] plot the 
stability limits given by equation ([6]). We see that the more efficient utilization 
of the upstream channel of the GATE-driven scheduling increases the stability 
limit substantially, especially for small maximum grant sizes. We also observe that 
GATE-driven scheduling presetrves the 50% delay advantage at low and medium 
loads. Mean cycle times are compared in Figure [9j The cycle time for REPORT- 
driven is again determined mainly by the propagation time with a slight increase 
as load tens to the capacity. The maximum cycle time for GATE-driven is equal to 
N(A R + d max ). 

5.3. Full availability WDM scheduling. We present results the full availability 
WDM PON using a single round robin cycle per OLT. Results with one cycle per 
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Figure 7. Mean packet delay with limited gated service for 
GATE-driven (one wavelength) 
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Figure 8. Mean packet delay with limited gated service for 
RREPORT-driven (one wavelength) 



wavelength are very similar. Figure [10] represents the mean packet delay for GATE- 
driven scheduling for 2 and 3 wavelengths with a maximum grant size of lOKbytes. 
We consider symmetric and asymmetric traffic loads. In the asymmetric traffic 
case, we set the arrival rate of 15 ONUs to 7 and that of the 5 other ONUs 57 
and vary 7 to cover the load range. The figure plots the mean delay of the most 
heavily loaded ONUs. The vertical lines are the stability limits for both traffic 
scenarios obtained through equation |6]). The mean packet delay remains around 1 
ms up to loads very close to capacity confirming that this stability limit is indeed 
the essential performance parameter for these systems. Figure [IT] gives the mean 
cycle time for symmetric traffic depicting the maximum attained when every ONU 
transmits a maximal grant per cycle. 
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Figure 9. Mean cycle time with limited gated service (one wavelength) 
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Figure 10. Mean packet delay with limited grant sizing for 
GATE-driven (full wavelengths availability) 



5.4. Limited availability WDM scheduling. Figure |l2"| shows results for a lim- 
ited availability system where ONUs are equipped with a tunable transmitter. We 
consider 3 wavelengths and set the grant size gated grant sizing to 8 kbytes. The 
number of active ONUs is now 4 and 20 and we consider both symmetric and asym- 
metric traffic. The asymmetry is such that 75% of ONUs have load 7 and 25% have 
load 5 7. Vertical lines on the figure correspond to the capacity estimated using 



formula (21 1 giving the capacity of a corresponding full availability system. For the 



20 ONU network, these analytical results are a good approximation mainly because 
in heavy load each ONU rarely has the opportunity to use more than one wave- 



length. For only 4 ONUs with traffic imbalance, formula (21) fails to approximate 
capacity. In fact, for this case, capacity is determined by the additional necessary 
condition < d™ ax j{d™ ax + A R ) applied to the single heavily loaded ONU giving 
a capacity of 1.58 Gb/s. 
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Figure 1 1 . Mean cycle time with limited grant sizing for GATE- 
driven (full wavelength availability) 
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Figure 12. Mean packet delay with limited grant sizing for 
GATE-driven (limited availability) 



6. Conclusions 

The proposed GATE-driven DBA algorithm for long-reach WDM EPONs has 
been shown to have attractive properties in terms of both performance and flexibil- 
ity. Traffic capacity is maximized while the mean packet delay is lower than that 
obtained with previously proposed online and offline schedulers. The algorithm is 
simple to implement and conforms to existing EPON standards. The algorithm 
allows flexible periodic service cycles that could be adapted dynamically depend- 
ing on observed ONU activity. We have not yet explored the full potential of this 
flexibility. 

We have derived closed-form traffic capacity formulas for a WDM EPON where 
all wavelengths are fully available to the ONUs. These formulas can constitute 



20 



N. ANTUNES, C. FRICKER, PH. ROBERT, AND J. ROBERTS 



good approximations for more practically interesting networks where ONUs have 
limited access to wavelengths or can only use one wavelength at a time. 

There is clearly considerable scope for further work. We would like to explore 
how one can exploit the potential for flexible scheduling to realize effective inter- 
ONU differentiation. It would be useful also to perform evaluations that confirm 
our belief that simple online scheduling is adequate to meet QoS requirements. In 
other words, we intend to show that per flow latency and throughput requirements 
can best be realized by intra-ONU differentiation that is independent of the DBA. 
It remains, of course, to more completely analyze the hybrid WDM EPON to derive 
useful dimensioning methods that properly account for the different capabilities of 
legacy and evolved ONUs that are controlled by the OLT. 
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